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Abstract 

The Birnbaum-Saunders distribution has been used quite effectively to model times 
to failure for materials subject to fatigue and for modeling lifetime data. In this paper 
we obtain asymptotic expansions, up to order n -1 ' 2 and under a sequence of Pitman 
alternatives, for the nonnull distribution functions of the likelihood ratio, Wald, score 
and gradient test statistics in the Birnbaum-Saunders regression model. The asymptotic 
distributions of all four statistics are obtained for testing a subset of regression parameters 
and for testing the shape parameter. Monte Carlo simulation is presented in order to 
compare the finite-sample performance of these tests. We also present an empirical 
application. 

Keywords: Birnbaum-Saunders distribution, Fatigue life distribution, Gradient test, 
Lifetime data, Likelihood ratio test, Local power, Score test, Wald test. 
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CN| ■ 1. Introduction 

m 

The Birnbaum-Saunders (BS) distribution has received c onside rable attention in the 
iy-) • last few years. It was proposed bv lBirnbaum and Saunders! (|1969al la) and is also known 



as the fatigue life distribution. It describes the total time until the damage caused by 
the development and growth of a dominant crack reaches a threshold level and causes a 
failure. The random variable T is said to have a BS distribution with parameters a and 
77, denoted by T ~ BS(a,ri), if its probability density function is given by 

/(i;a,7 ? ) = K (a,r ? )i- 3 / 2 (t + ^)exp|-^^|, t > 0, 

where K,(a,r)) = c-xjp(a~ 2 )/(2ay/2irr]), t(z) = z + z _1 , a > (shape parameter) and 
•q > (scale parameter). It is positively skewed, the skewness decreasing with a. For 
any constant k > 0, it follows that kT ~ BS(a,kr/). It is also noteworthy that the 
reciprocal proper ty holds: T^ 1 ~ BS(a, f/" 1 ), which is in the same family of distributions 
( Saundersl . 1 1974 ). There are several recent articles considering the BS distribution; see 
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for example. IWu and Wond (f20o3 ) .lDiaz-Garcia and Leival ll2005MLemorite et all (120071. 



2008), Balakrishnan et al. (2007), Kundu ct al. (2008) 



iLeiva et aljj200£ 



iGomes et al. 



(|2009t ). lGuiraud et al.l (|2009t ) . ILeiva et al.l(|2009l) . Txu and Tantd(|2010l ). lMeintanid(|201o|) ' 
am ong others. 



Rieck and Nedelmanl ( 1991 ) introduced a log-linear regression model based on the 
BS distribution by showing that if T ~ BS(a,i]), then Y — log(T) has a sinh-normal 
distribution with shape, location and scale parameters given by a, fi — log(ji) and a = 2, 
respectively, say Y ~ SJ\f(a, /i, 2). The regression model proposed by the authors is 
given by 



yi =x t (3 + £ t 



t=l, 



(1) 



where yi is the logarithm of the ith observed lifetime, xj = (xn , . . . , Xi P ) contains the ith 
observation on p covariates (p < n), (3 = (j3\, . . . , f3 p ) T is a vector of unknown regression 
paramet ers, and £j ~ SJ\f(a , 0,2). Diagnostic t ools for the BS regress ion model can be 
found in lGaleaet all (|2004l) . IXi and Weil (|2007l ) and ILeiva et~all (J20071 ). 

In the BS regression model hypothesis testing inference is usually performed using 
the likelihood ratio, Rao score and Wald tests. A ne w criter i on fo r testing hypothesis, 
referred to as the gradient test, has been proposed bv iTerrelll ( 2002 ). Its statistic shares 
the same first order asymptotic properties with the likelihood ratio, Wald and score 
statistics a nd is very simple when compared with the other three classic tests. In fact, 
Raol ( 20051 ) wrote: "The suggestion by Terrell is attractive as it is simple to compute. It 
would be of interest to investigate the performance of the [gradient] statistic." To the 
best of our knowledge, however, there is no mention in the statistical literature on the 
use of the gradient test in BS regressions. 

In this paper we compare the four rival tests from two different points of view. First, 
we invoke asymptotic arguments. We then move to a finite-sample comparison, which is 
accomplished by means of a simulation study. Our principal aim is to help practitioners 
to choose among the different criteria when performing inference in BS regressions. 

On asymptotic grounds, it is known that, to the first order of approximation, the like- 
lihood ratio, Wald, score and gradient statistics have the same asymptotic distributional 
properties either under the null hypothesis or under a sequence of local alternatives, i.e. a 
sequence of Pitman alternatives converging to the null hypothesis at a convergence rate 
n -1 ' 2 . On the other hand, up to an error of order n _1 the corresponding criteria have 
the same size properties but their local powers differ in the n -1 ' 2 term. A meaningful 
comparison among the criteria can be performed by comparing the nonnull asymptotic 
expansions to order n^ 1 ' 2 of the distribution functions of these statistics under a sequence 
of Pitman alternatives. In t his regard, we can benefit from the work bylHavakawal (|1975T ). 
Harris and Peers! (|1980h and lLemonte and Ferraril (|2010ah . lHavakawal (|1975T ) derived the 
nonnull asymptotic expansions up to order n -1 ' 2 for the densities of the likelihood ra- 
tio aiid_WaJd_s_tadstics 1 _while an analogous result for the score statistic was obtained 
bv lHarris and Peers! (|1980n . Recently, the asympto tic expansion up to o r der n" 1 / 2 for 
the density of the gradient statistic was derived bv iLemonte and Ferraril ( 2010al ). The 
expansions obtained by these authors are extremely general but it can be very difficult 
or even impossible to particularize their formulas for specific regression models. As we 
shall see below, we have been able to apply their results for the BS regression model. 

The rest of the paper is organized as follows. Section [5] briefly describes the likelihood 
ratio, Wald, score and gradient tests. In Section [3] these tests are applied for testing 



hypotheses on the parameters of the BS regression model. In Section [4] we obtain and 
compare the local powers of the tests. Monte Carlo simulation results on the finite-sample 
performance of the tests are presented and discussed in Section [5] Section |B] contains an 
application to a real fatigue data set. Finally, Section [7] discusses our main findings and 
closes the paper with some conclusions. 

2. Background 

Consider a parametric model f(-;9) with corresponding log-likelihood function £(9), 
where 9 = (9j,9j) T is a k-vectoi of unknown parameters. The dimensions of 9\ and 
9 2 are k and k — q, respectively. Suppose the interest lies in testing the composite null 
hypothesis Ho : 9 2 = 9 2 ' against Hi : 9% ^ 6 2 , where 9 2 is a specified vector. 
Hence, 9\ is a vector of nuisance parameters. Let Ug and Kg denote the score function 
and the Fisher information matrix for 9, respectively. The partition for 9 induces the 
corresponding partitions 

U e = (Ul,Uj 2 ) T , Ke=(j C ™ K ™), ^ e 1= ( K2 i K 22 

where Kg 1 is the inverse of Kg. 

Let 9 — (9j , 9 ^ ) T and 9 — (9j, 9 2 ' ) T denote the maximum likelihood estimators 
of 9 = (9j ,9 2 ) T under Hi and Ho, respectively. The likelihood ratio (Si), Wald (5*2), 
score (6*3) and gradient (S4) statistics for testing Ho versus Hi are given by 

Si = 2{£(9) - 1(6)}, S 2 = (0 2 - 9f ) ) T K 22 ' 1 (9 2 - 0< o) ), 

S 3 - Ul K 22 Ug 2 , S 4 - Ul (9 2 ~ 9 { 2 0) ) , 

respectively, where K 22 = K 22 (9), K 22 = K 22 (9) and Ug 2 = Ug 2 (9). The limiting 
distribution of Si, S 2 , S3 and S4 is x\- q under Ho and xl-q A' i- e - a noncentral chi-square 
distribution with k — q degrees of freedom and an appropriate noncentrality parameter 
A, under Hi- The null hypothesis is rejected for a given nominal level, 7 say, if the test 
statistic exceeds the upper 1 — 7 quantile of the xt-q distribution. Clearly, S4 has a very 
simple form and does not involve knowledge of the information matrix, neither expected 
nor observed, and any matrix, unlike S 2 and S3. 

3. Test statistics in the BS regression model 

In what follows, we shall consider the tests which are based on the statistics Si, S 2 , 
S3 and 54 in the class of BS regression models for testing a composite null hypothesis. 
The log-likelihood function £(9) for the vector parameter 9 — (/3 T ,a) T from a random 
sample y = (1/1, . . . ,y n ) T obtained from model (TTJ, except for constants, can be written 
as 



n 1 n 

^)=5>g(£ 4l )-^E£ 



2 
)2- 



where &i = £n(9) = 2a" 1 cosh([y l - m]/2), £ i2 = S, i2 (9) = 2a _1 sinh([y. 4 - m}/2) and 
fj,i = xj j3, for i = 1, . . . ,n. It is assumed that the model matrix X = (xi, . . . ,x n ) T 
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has full column rank, i.e., rank(X) = p. The score function and the Fisher information 
matrix for — ((3 , a) are, respectively, given by 

U e = (Uj,U a ) T , K e = dmg{Kp, K a }, 

where Up = (l/2)X T s, U a = -n/a + (1/a) £™ =1 &, */s = ^)(X T X)/4, K a = 
2?i/a 2 , s = (si, . . . , s„) T with Sj = &1&2 - W61 and V(a) = 2 + 4/a 2 - (%/27r/a){l 



erf (■v/2/a)|exp(2 /a 2 ), erf(-) denoting the erro r function: erf (a;) = (2/y/ff) J* e * dt 
(see, for instance, iGradshtevn and Rvzhikl . 12007]). From the block-d iagonal form of Ke 



we have that (3 and a are globally orthogonal (|Cox and Reidl . Il987l) 

The hypothesis of interest is Ho : (3% = (3 2 , which will be tested against the 
alternative hypothesis Hi : (3 2 7^ /^ j where /3 is partitioned as (3 = (f3j,(3j) T , 

with /3i = (Pi, ■ ■ ■ ,Aj) T and /3 2 = (Pq+i, ■ ■ • , P P ) T ■ Here, /3g is a fixed column 
vector of dimension p — q. The partition for (3 induces the corresponding partitions 
Up = (Ul , f/J 2 ) T , with Up, = (l/2)Xjs and Up 2 = (l/2)X 2 T s, 

K ,'JCpn X^ _ ^(a) ^Tlj XjX 2 



K/321 Kp22J 4 \-X" 2 Xi X 2 X 2 

with the matrix X partitioned as X = (Xi X2). The likelihood ratio, Wald, score 
and gradient statistics for testing Ho can be expressed, respectively, as 

Si = 2{£(§) - 1(9)}, S 2 = ^(& - f3i o) ) T (R T R)0 2 - (3^), 

S3 = ^ ) s T X 2 (R T R)- 1 Xjs, S 4 = \s T X 2 (p 2 - /3f), 

where = (J3[ ,(3j ,a) T , = ( ( 3 1 r ,/3^ )T ,5) T , R = X 2 Xipf^Xi)- 1 ^^ and 
s = s(0). The limiting distribution of all these statistics under Ho is X?„. Notice that, 
unlike the Wald and score statistics, the gradient statistic does note involve the error 
function. 

Now, the problem under consideration is that of testing a composite null hypothesis 
Ho '■ ot = a^ against Hi : a ^ cr% where a^ is a positive specified value for a, and (3 
acts as a nuisance parameter. The four statistics are expressed as follows: 

__ / ^ _ (0) \ 2 

Si = 2{l(p,a)-l((3,a^)}, S 2 = 2n( - J* j , 

$3 = g ' S * = n(6 " 1] { a (0) 

where £ 2 = 6(0) = (l/»)E?=i&(0)> with * = 03 T ,") T and * = (/3 T ,a (0) ) T repre- 
senting the unrestricted and restricted maximum likelihood estimators of under Hi 
and Ho, respectively. 



4. Local power 

In this section we shall assume the following local alternative hypothesis Hi n : 02 = 
(3 2 + e, where e = (e a +i, . . . , e a ) T with e r = Oin -1 ^ 2 ) for r = q + 1, • . -,p. We follow 



the notation in iFerrari et al.1 (J1997I ) . Let 



-(t<> M = ( M °°°) 



where 

A ? = { K i 1 o)' M P = K^-A P . 

It then follows that m ra = m ar = m aa = 0, a ra = a ar = 0, for r = l,...,p, and 
a aa = K^ a — a 2 /(2n), where m ra and a ra are the (r,p + 1) elements of the matrices M 
and A, respectively, and m aa and a aa are the (p+ l,p+ 1) elements of the matrices Ai' 
and A, respectively. Additionally, let 







J-p—q I ' 



where ip- 9 is a (p — q) x (p — g) identity matrix. 

The nonnull distributions of the statistics Si, S2, S3 and S4 under Pitman alternatives 
for testing Ho : /3 2 = p 2 in the BS regression model can be expressed as 

3 
PrO£ < a;) = Gp- q , x (x) + J^ hkG p ^ q+2ktX (x) + O^- 1 ), i = 1, 2, 3, 4, 

fe=0 

where G m ^(^) is the cumulative distribution function of a non-central chi-square variate 
with m degrees of freedom and non-centrality parameter A. Here, A = e* T Kge* and the 
coefficients fo^'s (i = 1, 2, 3,4 and fc = 0, 1, 2, 3) can be written as 

1 p 1 p 

bu = —— 2_^ ( K rst — 2K r:S:t )e*e*e( — — 2_^ i K rst + 2K r:St )a rs £j 

r,s.i— 1 r.s,i— 1 

-, P P -, P 

^ » ^^ ^^ (Krst + K r,st)£r£ s e t ~ ^ 2_J^ Kaat "t" 2 K a,at) K a,a e t > 
r=g+ls,t=l t=l 

1 P 

fel2 = ~g ^ Kr > s -* e '- e * e *' &13 = °' 

1 P P -, P 

&21=—r X! ( Krst + 2K r,st) e r e s e t + ^ ^ r , st m r s £* t - - ^ (n rst + 2K r . st )K r ' S £* t 
r,s,i— 1 r,s,f — 1 r,s,£— 1 

-, P P -, P 

r=g+l s,i=l t=l 

5 



1 p 1 p 1 p 

b22= 2 ^ Kr < st€ * e *s e t + 2 51 K «' m « e *. & 23 = g J! K "* e * e * e 4' 
r,s,t= 1 r,s,t=l r,s,t=l 

1 p 1 p 1 p 

&31 = — ~ /_^ ( K rst ~ 2K riS ^)e*e*ej + - ^^ n T ^^ra TS e* t — — 2_^ ( K rst + ^K r ^ s t)a rs e* t 

r,s y t= 1 r,s,i— 1 r,s,i— 1 

^ V V , P 

— ~ ^^ ^^ («Vst + K r , st )e r e s e t — — 2_^{ K aat + ^ K a,at)K aa e t , 
r=g+l s,t=l t=l 

1 P 1 P 

&32 = -X J^ K rtS jm rs e* t , b 33 = -- ^ K r,s,t£r e *s e t > 



Z / (j 

r,s,i— 1 r,s,t— 1 



P -I P 



1 1 1 

&41 = t 2_/ K rstK r ' s 4 — 2 XJ ( Krs * + 2K r,st)e*e*£t ~ 7 2_, ( 4k ^s* + 3^ st )a rs £t 

r,s,i— 1 r,s,i— 1 r,s,i— 1 

-, P P , P 

" 7 / , , / y ( K rst + K r;S t)e r e s e t — — 2_^\ K aat + 2 K a,at) K a,a € t > 



r =g+l 8 ,t=l 4=1 



1 P 1 P 1 P 



r,s,t= 1 



where the k's are defined as n rs = E(d 2 t(6)/df3 r dp s ), n rst = E(d 3 £(0)/dp r d/3 s d/3 t ), 
Krst = E{{de{e)/dp r ){d 2 £(e)/dp s df3 t )} 1 K aat = E(d 3 £{0)/da 2 df3 t ), etc., and K r - S is 
the (r, s)th element of inverse of Kp. The coefficients bio are obtained from b^ = 
-(bii+bi2 + bi 3 ), fori = 1,2,3,4. 

After some algebra, it is possible to show that, in the BS regression model, 

Krst = K r,st — Kr,s,t — ^aat = K aa,t = U, T, S, t = 1, . . . , p. 

Therefore, bn~ = for i = 1, 2, 3, 4 and k = 0, 1, 2, 3, and we can write 

Pv(Si <x) = G p _ qtX {x) + Oin- 1 ), 2 = 1,2, 3, 4. 

This is a very interesting result, which implies that the likelihood ratio, score, Wald and 
gradient tests for testing the composite null hypothesis Ho '■ &i = P2 have exactly the 
same local power up to an error of order n~ x . 

We now turn to the problem of testing hypotheses on a, the shape parameter. The 
nonnull asymptotic distributions of the statistics S\, S%, S 3 and S4 for testing Ho : a — 
aj(°) under the local alternative H\ n : a = a^ + e, where e = a — a^ is assumed to be 
Oin- 1 / 2 ), is 

3 
Pr(S 4 < a:) = G hX (x) + ^ b lk G l+2KX {x) + 0( n - 1 ), 2 = 1,2, 3, 4, 

fc=0 

with A = 2ne 2 /a 2 . The bi^s for the test of Ho '■ a — a^ are easy to obtain and 
are given by 6n = {n aaa - 2K a . aia )e 3 /6 + YZs=i( K rsa + 2n r . sa )K r - s e/2 - {n aaa + 

6 



Kct,cra)e /2, 6l2 — K a ,a,a£ /6, 613 — 0, 621 — (n a aa + 2n a ^ aa )€ /2 — K a ^ aa K, a,a e + 
Z_<r s=l \ K rsa T 2,K r ^ sa )K ' £/2 T (^ aQQ + ZKq, :QQ ,Jk : €/2 (K QaQ + K Q , :QQ ,Je /2, O22 = 

(*>a,aaf TKaaa'' ' £j/A ^23 == K aaaf /O, O31 = (K QQa ^^a,a,aj e /O Ka,a,a K ' £/-^T 
2—ir s =l\ Krsa ~T~ r , sa ) K ' V ( K aaoTKa iaa )e /2, O32 = H aaa K ' €/ 2, O33 = K QjQjQ € /O, 

"41 = — 2^ r s= i K rsa K ' e/4 — K aQQ K ' e/4 + K a ,aa£ /2 + 2_, r s= i(4K r:SQ + OK rsa )K ' €/4, 

^42 = K aQQ K a ' Q e/4— (k qqq + 2n a , aa )e 3 /4 and 643 = K QQQ e 3 /12, where K Q,a = a 2 /(2n). 
The coefficients fe^o ar e obtained from b^ = — (bn + ba. + ^3), for i — 1,2, 3, 4. We have 
that 

_ lOra 6n _ 8n _ _(2 + a 2 )^ 

^aaa — ^~ ; ^a.aa — T; ^q a. a — Ti Krsa ^r.sa o / ^ir^is: 

a a q,j a -i a a ± — / 

i—1 

for r,s = 1, . . . ,p. After some algebra the coefficients bik reduce to 

3ne 3 2n(2 + a 2 )e , 4ne 3 , , 3ne 3 5e 2n(2 + a 2 )e 

611 = — — , 612 = , &13 = 0, 621 = 1 — — j 

a 3 a 3 tp(a) 3a 3 ' ~ a 3 2a a 3 ip(a) 



3ne 3 5e 5ne 3 3ne 3 2e 2p(2 + a 2 )e _2e 

or la. oa^ a a a a^"ip\a) a 



4ne 3 3ne 3 5e 2p(2 + a 2 )e be ne 3 bne 3 

°33 = „ 5 ; 041 = ~ -; 5 , / s — ! o 42 = h 7—7, 043 — 



3a 3 ' a 3 4a a 3 ip{a) 4a 2a 3 ' 6a 3 

It should be noticed that the above expressions depend on the model only through a and 
the rank of the model matrix X; they do not involve the unknown parameter (3. 

We now present an analytical comparison among the local powers of the four tests 
for testing the null hypothesis Hq : a = a^ - 1 . Let IT be the power function, up to order 
n^ 1 / 2 , of the test that uses the statistic Si, for i = 1, 2, 3, 4. We have 

3 
IT - n, = J2( b Jk - b lk )G l+2KX {x), (2) 

for i 7^ j. It is well known that 

G m ,\(x) - G m+2 ,\{x) = 2g m+2 ,\(x), (3) 

where g v ,\{x) is the probability density function of a non-central chi-square random 
variable with v degrees of freedom and non-centrality parameter A. From Q and ([3]), 
we have 

5e 10ne 3 f 4e 8ne 3 1 

rii - n 2 = — 55,A(a>) + - gr,\( x )i ni — 1I3 = — -s — g5,\(x) + -z-*gi,\{x) >, 

a da a 1 a da* J 

5e , s 5ne 3 , A „ „ 3e , , ne 3 



n i - n 4 = -j — 3 5 ,A(a;) + y-3-37, a (x) >, n 3 - n 4 = — g 5i A (x) + — g-57,A(*). 
Hence, we arrive at the following inequalities: II3 > II4 > IIi > II2 if a > cr°\ and 

n 3 < n 4 < n x < n 2 if a < a®. 

The most important finding obtained so far is that the likelihood ratio, score, Wald 
and gradient tests for testing the null hypothesis Ho : (3% = p% share the same null 
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size and local power up to an error of order n^ 1 . To this order of approximation the 
null distribution of the four statistics is Xp-q- Therefore, if the sample size is large, type 
I error probabilities of all the tests do not significantly deviate from the true nominal 
level, and their powers are approximately equal for alternatives that are close to the null 
hypothesis. 

The natural question now is how these tests perform when the sample size is small 
or of moderate size, and which one is the most reliable. In the next section, we shall use 
Monte Carlo simulations to put some light on this issue. 

5. Finite-sample performance 

In this section we shall present the results of a Monte Carlo simulation in which we 
evaluate the finite sample performance of the likelihood ratio, Wald, score and gradient 
tests. The simulations were based on the model 

y t = frxn + /3 2 x i2 H h /3 p x ip + e. t , 

where Xn = 1 and £j ~ SAf(a, 0, 2), i = l,..,,n. The covariate values were selected 
as random draws from the uniform U(0, 1) distribution and for fixed n those values 
were kept constant throughout the experiment. The number of Monte Carlo replications 
was 15,000, the nominal levels of the tests were 7 = 10%, 5% and 1%, and all s imula- 



tions were performed using the Ox matrix programming language ( Doornild . 120071 ) . Ox 
is freely distributed for academic purposes and available at http://www.doornik.com. 
All log-likelihood maximizations with respect to f3 and a were carried out using the 
BFGS quasi-Newton method with analytic first derivatives through MaxBFGS subroutine. 
This method is generally reg arded as the best-performing nonlinear optimization method 



( Mittelhammer et alll2000l p. 199). The initial values in the iterative BFGS scheme were 



j3 = (X T X) 1 X T y for j3 and \fo? for a, where 



2_ V.,^^'-^/ 3 



a ■= — y sinh 

(=1 



n *-^ 



First, the null hypothesis is Ho : /3 p _i = j3 p = 0, which is tested against a two-sided 
alternative. The sample size is n = 25, a = 0.5, 1.0 and p = 3, 4, . . . , 7. The values of the 
response were generated using fix = ■ ■ ■ = $ v -i = 1. The null rejection rates of the four 
tests are presented in Table [TJ It is evident that the likelihood ratio (Si) and Wald (S 2 ) 
tests are markedly liberal, more so as the number of regressors increases. The score (S3) 
and gradient (S4) tests are also liberal in most of the cases, but much less size distorted 
than the likelihood ratio and Wald tests in all cases. For instance, when a = 0.5, p = 6 
and 7 = 5%, the rejection rates are 10.12% (Si), 12.77% (S 2 ), 7.12% (S3) and 7.32% 
(S4). It is noticeable that the score test is much less liberal than the likelihood ratio 
and Wald tests and slightly less liberal than the gradient test. The score and gradient 
tests are slightly conservative in some cases. Additionally, the Wald test is much more 
liberal than the other tests. Similar results hold for a = 1.0. Table [2] reports results 
for a — 0.5 and p — 5 and sample sizes ranging from 15 to 200. As expected, the null 
rejection rates of all the tests approach the corresponding nominal levels as the sample 
size grows. Again, the score and gradient tests present the best performances. 



Table 1: Null rejection rates (%); a = 0.5 and 1.0, with n = 25. 
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8.76 


12.53 


4.92 


6.05 


2.35 


4.59 


0.64 


0.93 


6 


16.74 


20.65 


12.28 


13.86 


9.77 


13.61 


5.77 


6.89 


2.71 


5.39 


0.84 


1.09 


7 


18.67 


22.90 


13.49 


15.51 


11.27 


15.59 


6.77 


8.03 


3.55 


6.59 


1.00 


1.47 



Table 2: Null rejection rates (%); a = 0.5, p = 5 and different sample sizes. 
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Figure 1: Power of four tests: n = 25, p = 4, a = 0.5 and 7 = 5%. 

We now turn to the finite-sample power properties of the four tests. The simulation 
results above show that the tests have different sizes when one uses their asymptotic \ 2 
distribution in small and moderate-sized samples. In evaluating the power of these tests, 
it is important to ensure that they all have the correct size under the null hypothesis. 
To overcome this difficulty, we used 500,000 Monte Carlo simulated samples, drawn 
under the null hypothesis, to estimate the exact critical value of each test for the chosen 
nominal level. We set n = 25, p — 4, a — 0.5 and 7 = 5%. For the power simulations 
we computed the rejection rates under the alternative hypothesis P3 = /?4 = 5, for S 
ranging from —2.0 to 2.0. Figure [1] shows that the power curves of the four tests are 
indistinguishable from each other. As expected, the powers of the tests approach 1 as 
\S\ grows. Power simulations carried out for other values of n, p and 7 showed a similar 
pattern. 

Overall, in small to moderate-sized samples the best performing tests are the score 
and the gradient tests. They are less size distorted than the other two and are as powerful 
as the others. 

We also performed Monte Carlo simulations considering hypothesis testing on a. To 
save space, the results are not shown. The score and gradient tests exhibited superior 
behavior than the likelihood ratio and Wald tests. For example, when n — 35, p = 4, 
7 = 5% and W. : a = 0.5, we obtained the following null rejection rates: 9.99% (Si), 
15.89% (Sa), 5.29% (S3) and 6.99% (S 4 ). Again, the best performing tests are the score 
and gradient tests. 



6. Application 

This application focuses on model ing the die lifetime (T) in the process of metal 
extrusion. The data were taken from Tepadatu et al.l (|2005l ). According to the authors, 
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"the estimation of tool life (fatigue life) in the extrusion operation is important for 
scheduling tool changing times, for adaptive process control and for tool cost evaluation." 
The authors noted that "die fatigue cracks are caused by the repeat application of loads 
which individually would be too small to cause failure." The BS regression model is then 
appealing in this context since the main motivation for the BS distribution is the fatigue 
failure time due to propagation of an initial crack. 
We consider the following regression model: 

2/i = I3ix u + f3 2 x 2l + fi 3 x 3l + j3 4 X4i + /3 5 x 2l x 3l + /3 6 x 2l x 4:l + /3 7 x 3l x 4l + e t , 

where yi = log(Ti), Si ~ SAf(a, 0,2), Xu — 1 and the covariates are x 2 i (friction co- 
efficient), x 3i (angle of the die) and x 4i (work temperature), for i = 1, ...,15. We 
wish to test the significance of the interaction effects, i.e., the interest lies in testing 
*H-o : k — @6 — Pi = 0. The likelihood ratio, Wald, score and gradient test statistics 
equal 6.387 (p-value: 0.094), 8.039 (p-value: 0.045), 5.144 (p-value: 0.162) and 5.206 
(p- value: 0.157), respectively. Hence, the null hypothesis is rejected at the 10% nominal 
level when inference is based on the likelihood ratio or the Wald test, but the opposite 
decision is reached when either the score or the gradient test is used. Recall that our 
simulation results indicated that the likelihood ratio and Wald tests arc markedly liberal 
in small samples (here, n = 15), which leads us to mistrust the inference delivered by 
the likelihood ratio and Wald tests. Therefore, we removed the interaction effects from 
the model as indicated by the score and gradient tests. 

The model containing only main effects is Hi = fii + P 2 x 2 i + j3 3 x 3 i + PaXu + Si, for 
i = 1, . . . , 15. The null hypothesis Ho ■ fl 3 — is strongly rejected by the four tests at 
the usual significance levels. All tests also suggest the individual and joint exclusions 
of the friction coefficient and angle of the die from the model. Hence, we end up with 
the regression model j/j = /3i + P^x^i + £», for i = 1, . . . , 15. The maximum likelihood 
estimates of the parameters are (standard errors in parentheses): fix — 6.2453(0.326), 
/3 4 = 0.0052 (0.001) and a = 0.2039 (0.037). 

7. Discussion 

The BS regression model is becoming increasingly popular in lifetime analyses and 
reliability studies. In this paper, we dealt with the issue of performing hypothesis testing 
concerning the parameters of this model. We considered the three classic tests, likelihood 
ratio, Wald and score tests, and a recently proposed test, the gradient test. For the 
discussion that follows, let us concentrate on tests regarding the regression parameters, 
which arc, in general, of primary interest. 

The four tests have the same distribution, under cither the null hypothesis or a 
sequence of local alternatives, up to an error of order n _1 , as we showed. Our Monte 
Carlo simulation study added some important information. It revealed that the likelihood 
ratio and the Wald tests can be remarkably oversized if the sample is small. The score 
and the gradient tests are clearly much less size distorted than the other two tests. Our 
power simulations suggested that all the four tests have similar power properties when 
estimated correct critical values are used. Overall, this is an indication that the score 
and the gradient tests should be prefered. 
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At this point, a discussion on small-sample corrections for the classic tests is in or- 
der. A Bartlett correction for the likeli hood ratio statis t ic and a Ba rtlett-type correction 
for the score statistic were derived in iLemonte et al.l ( 20101 ) and iLemonte and Ferrari 



(|2010bl ). respectively; see also Cordeiro and Ferrari (1991). The corrected statistics have 
the following interesting properties: (i) the uncorrected and corrected statistics have 
the same asymptotic distribution under the null hypothesis; (ii) the order of the error 
of the approximation for the distribution of the test statistics by \ 2 i s smaller for the 
corrected statistics than for the uncorrected statistics; (iii) the corrections have no effect 
on the n" 1 / 2 term of th e loc al power of the corresponding tests; (iv) simulation results 



the n ' term ot th e loc al power ot the cor responding tests; (ivj simulation results 
Lemonte et al.l ( 2010T ) and ILemonte and Ferraril ( 2010a ) show that these corrections 



reduce the size distortion of the tests and that the best performing test in small and 
moderate-sized samples is the test which uses the corrected score statistic. Therefore, 
the Bartlett-type-corrected score test is an excellent alternative to the tests under con- 
sideration in the present paper. The slight disadvantage of such an alternative is the 
extra computational burden involved in computing the Bartlett-type correction. 

We computed the corrected versions of likelihood ratio and score statistics for the 
hypotheses tested in the real data application presented in Section [6l Recall that the 
hypothesis of no interaction effects is rejected by the likelihood ratio and Wald tests, but 
not rejected by the score and gradient tests. It is noteworthy that the decision reached 
by either the corrected likelihood ratio test or the corrected score test is in agreement 
with that obtained by the later two tests and in disagreement with the likelihood ratio 
and Wald tests, which tend to reject the null hypothesis much more often than indicated 
by the significance level. 

Finally, our overall recommendations for practitioners when performing testing in- 
ference in BS regressions are as follows. The score test or the gradient test should be 
prefered as both perform better than the likelihood ratio and Wald tests in small and 
moderate-sized samples. While the gradient test is a little more liberal than the score 
test, it is easier to calculate. The Bartlett-type corrected score test is a further better 
option although it requires a small extra computational effort. 
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